2850-106. ST25 
SEQUENCE LISTING 

<110> Yokoyama, Shigeyuki 
Shirouzu, Mikako 
Sakamoto, Ayako 
Sakamoto, Kensaku 

<120> METHOD OF EXPRESSING PROTEINS COMPRISING NON-NATURALLY-OCCURRING 

amino acids (as Amended) 

<130> P/2850-106 

<140> 10/532,948 
<141> 2005-11-10 

<150> PCT/JP03/14028 
<151> 2003-10-31 

<150> 2002-318846 
<151> 2002-10-31 

<160> 34 

<170> Patentln version 3.3 

<210> 1 

<211> 167 

<212> DNA 

<213> Artificial 

<220> 

<223> an artificial base sequence consisting of a leader sequence of 
human tRNA gene, and the tRNAtyr gene of B. stearothermophilus 
with a CUA anticodon, but without the terminal CCA sequence and a 
transcription terminator 

<400> 1 

agcgctccgg tttttctgtg ctgaacctca ggggacgccg acacacgtac acgtcggagg 60 

ggtagcgaag tggctaaacg cggcggactc taaatccgct ccctttgggt tcggcggttc 120 

gaatccgtcc ccctccagac aagtgcggtt tttttctcca gctcccg 167 



<210> 2 

<211> 44 

<212> DNA 

<213> Artificial 

<220> 

<223> a PCR primer used for amplifying a part of genomic DNA of E. coli 

<400> 2 

ggaattccat atggcaagca gtaacttgat taaacaattg caag 44 



<210> 3 

<211> 50 

<212> DNA 

<213> Artificial 

<220> 

<223> a PCR primer used for amplifying a part of genomic DNA of E. 
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coli . 



<400> 3 

gccgaagctt gtcgactttc cagcaaatca gacagtaatt ctttttaccg 



<210> 4 

<211> 40 

<212> DNA 

<213> Artificial 

<220> 

<223> a PCR primer used in the overlapping extension in the present 
invention 



<210> 5 

<211> 40 

<212> DNA 

<213> Artificial 

<220> 

<223> a PCR primer used in the overlapping extension in the present 
invention 



<220> 

<221> misc_feature 

<222> (16) . . (16) 

<223> m represents c or a 

<220> 

<221> misc_feature 

<222> (17) . . (18) 

<223> n is a, c, g, or t 



<210> 6 

<211> 33 

<212> DNA 

<213> Artificial 

<220> 

<223> a PCR primer used in the overlapping extension in the present 
invention 



<400> 4 

aggatcgaag ccgcaagcga gcgcgatcgg gccttgcgcc 



40 



<400> 5 

aggatcgaag ccgcamnnga gcgcgatcgg gccttgcgcc 



40 



<400> 6 

acggtgtggt gctgtctatt ggtggttctg acc 



33 



<210> 7 

<211> 33 

<212> DNA 

<213> Artificial 



<220> 

<223> a PCR primer used in the overlapping extension in the present 
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invention 



<400> 7 

acggtgtggt gctggcaatt ggtggttctg acc 



33 



<210> 8 

<211> 33 

<212> DNA 

<213> Artificial 

<220> 

<223> a PCR primer used in the overlapping extension in the present 
invention 



<210> 9 

<211> 33 

<212> DNA 

<213> Artificial 

<220> 

<223> a PCR primer used in the overlapping extension in the present 
invention 



<210> 10 

<211> 32 

<212> DNA 

<213> Artificial 

<220> 

<223> a PCR primer used in the overlapping extension in the present 
invention 

<400> 10 

ttcttcggat ccaaccagac tgcgccgcct tc 32 



<210> 11 

<211> 30 

<212> DNA 

<213> Artificial 



<223> a PCR primer used in the overlapping extension in the present 
invention 

<400> 11 



<400> 8 

acggtgtggt gctgaacatt ggtggttctg acc 



33 



<400> 9 

acggtgtggt gctgtgcatt ggtggttctg acc 



33 



<220> 



gatcatctgg ttaacggaga agtgtttgcc 



30 



<210> 12 
<211> 26 
<212> DNA 



<213> Artificial 
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<220> 

<223> a PCR primer used in the overlapping extension in the present 
invention 

<400> 12 

gaccttcctg tgcgatattg gcaaac 26 

<210> 13 

<211> 12 

<212> DNA 

<213> Artificial 

<220> 

<223> the box A consensus sequence 
<220> 

<221> misc_feature 

<222> (2).. (2) 

<223> r represents g or a 

<220> 

<221> misc_feature 

<222> (5).. (6) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (10) . . (10) 

<223> n is a, c, g, or t 

<400> 13 

trgcnnagyn gg 12 

<210> 14 

<211> 11 

<212> DNA 

<213> Artificial 

<220> 

<223> the box B consensus sequence 
<220> 

<221> misc_feature 

<222> (8).. (8) 

<223> n is a, c, g, or t 

<400> 14 

ggttcgantc c 11 

<210> 15 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> a sequence of a primer binding site pbsl 

<400> 15 
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<210> 16 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> a sequence of a primer binding site pbs2 

<400> 16 

cgactacgat attcgcgcag 



<210> 17 

<211> 12 

<212> DNA 

<213> Artificial 

<220> 

<223> a sequence of a Bstxi-1 site 

<400> 17 
ccagcagact gg 



<210> 18 

<211> 12 

<212> DNA 

<213> Artificial 

<220> 

<223> a sequence of a Bstxi-2 site 

<400> 18 
ccagcttcct gg 



<210> 19 

<211> 63 

<212> DNA 

<213> Artificial 

<220> 

<223> a nucleotide sequence coding a short peptide used for 

substitution of green fluorescent protein (cyanfluorescent 
mutation) 

<400> 19 

atgggaacta gtccatagtg gtggaattct gcagatatcc agcacagtgg cggccgccgc 
gtc 



<210> 20 

<211> 11 

<212> DNA 

<213> Artificial 

<220> 

<223> another box B consensus sequence 
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<220> 

<2 2 1> mis c_f ea t u re 

<222> (8).. (8) 

<223> n is a, c, g, or t 

<400> 20 

agttcgantc t 11 



<210> 21 

<211> 31 

<212> DNA 

<213> Artificial 

<220> 

<223> a sequence of a primer used for amplifying the sequence of SEQ ID 
No. 1 

<400> 21 

cacagaattc tcgggagctg gagaaaaaaa c 31 



<210> 22 

<211> 30 

<212> DNA 

<213> Artificial 

<220> 

<223> a sequence of another primer used for amplifying the sequence of 

SEQ ID NO. 1 
<400> 22 

cacaaagctt agcgctccgg tttttctgtg 30 



<210> 23 

<211> 40 

<212> DNA 

<213> Artificial 

<220> 

<223> a sequence of a primer set used for amplifying a fragment having 
a primer binding site pbsl upstream of the sequence of SEQ ID No. 
1 and Bstxi-l site downstream thereof 

<400> 23 

agcgagtgtt aaccctgcct agcgctccgg tttttctgtg 40 



<210> 24 

<211> 38 

<212> DNA 

<213> Artificial 

<220> 

<223> a sequence of a primer set used for amplifying a fragment having 
a primer binding site pbsl upstream of the sequence of SEQ ID No. 
1 and Bstxi-l site downstream thereof 

<400> 24 

acacacccag cagactggcg ggagctggag aaaaaaac 38 
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<210> 25 

<211> 38 

<212> DNA 

<213> Artificial 

<220> 

<223> a sequence of a primer set used for amplifying a fragment having 
a Bstxi-l site upstream of the sequence of SEQ ID No. 1 and 
another Bstxi-l site downstream from the first Bstxi-l site 

<400> 25 

acacacccag cagactggag cgctccggtt tttctgtg 38 



<210> 26 

<211> 38 

<212> DNA 

<213> Artificial 

<220> 

<223> a sequence of a primer set used for amplifying a fragment having 

a Bstxi-l site upstream of the sequence of SEQ ID No. 1 and 

another Bstxi-l site downstream from the first Bstxi-l site 

<400> 26 

acacacccag cttcctggcg ggagctggag aaaaaaac 38 



<210> 27 

<211> 38 

<212> DNA 

<213> Artificial 

<220> 

<223> a sequence of a primer set used for amplifying a fragment having 
a BstXl-2 site upstream of the sequence of SEQ ID no. 1 and a 
primer binding site pbs-2 

<400> 27 

acacacccag cttcctggag cgctccggtt tttctgtg 38 



<210> 28 

<211> 40 

<212> DNA 

<213> Artificial 

<220> 

<223> a sequence of a primer set used for amplifying a fragment hving a 
BstXl-2 site upstream of the sequence of SEQ ID No. 1 and a 
primer binding site pbs-2 

<400> 28 

ctgcgcgaat atcgtagtcg cgggagctgg agaaaaaaac 40 



<210> 29 

<211> 424 

<212> PRT 

<213> Escherichia coli 



<400> 29 
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Met Ala Ser ser Asn Leu lie Lys Gin Leu Gin Glu Arg Gly Leu val 
1 5 10 15 

Ala Gin val Thr Asp Glu Glu Ala Leu Ala Glu Arg Leu Ala Gin Gly 
20 25 30 

Pro lie Ala Leu Tyr cys Gly Phe Asp Pro Thr Ala Asp Ser Leu His 
35 40 45 

Leu Gly His Leu val Pro Leu Leu Cys Leu Lys Arg Phe Gin Gin Ala 
50 55 60 

Gly His Lys Pro val Ala Leu val Gly Gly Ala Thr Gly Leu lie Gly 
65 70 75 80 

Asp Pro Ser Phe Lys Ala Ala Glu Arg Lys Leu Asn Thr Glu Glu Thr 
85 90 95 

val Gin Glu Trp val Asp Lys lie Arg Lys Gin val Ala Pro Phe Leu 
100 105 110 

Asp Phe Asp Cys Gly Glu Asn Ser Ala lie Ala Ala Asn Asn Tyr Asp 
115 120 125 

Trp Phe Gly Asn Met Asn val Leu Thr Phe Leu Arg Asp lie Gly Lys 
130 135 140 

His Phe Ser val Asn Gin Met lie Asn Lys Glu Ala Val Lys Gin Arg 
145 150 155 160 

Leu Asn Arg Glu Asp Gin Gly lie Ser Phe Thr Glu Phe Ser Tyr Asn 
165 170 175 

Leu Leu Gin Gly Tyr Asp Phe Ala Cys Leu Asn Lys Gin Tyr Gly Val 
180 185 190 

val Leu Gin lie Gly Gly ser Asp Gin Trp Gly Asn He Thr Ser Gly 
195 200 205 

lie Asp Leu Thr Arg Arg Leu His Gin Asn Gin Val Phe Gly Leu Thr 
210 215 220 

val Pro Leu lie Thr Lys Ala Asp Gly Thr Lys Phe Gly Lys Thr Glu 
225 230 235 240 



Gly Gly Ala val Trp Leu Asp Pro Lys Lys Thr Ser Pro Tyr Lys Phe 
245 250 255 
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Tyr Gin Phe Trp lie Asn Thr Ala Asp Ala Asp val Tyr Arg Phe Leu 
260 265 270 

Lys Phe Phe Thr Phe Met Ser lie Glu Glu lie Asn Ala Leu Glu Glu 
275 280 285 

Glu Asp Lys Asn Ser Gly Lys Ala Pro Arg Ala Gin Tyr val Leu Ala 
290 295 300 

Glu Gin val Thr Arg Leu Val His Gly Glu Glu Gly Leu Gin Ala Ala 
305 310 315 320 

Lys Arg lie Thr Glu Cys Leu Phe Ser Gly Ser Leu ser Ala Leu Ser 
325 330 335 

Glu Ala Asp Phe Glu Gin Leu Ala Gin Asp Gly val Pro Met Val Glu 
340 345 350 

Met Glu Lys Gly Ala Asp Leu Met Gin Ala Leu Val Asp Ser Glu Leu 
355 * 360 365 

Gin Pro Ser Arg Gly Gin Ala Arg Lys Thr lie Ala Ser Asn Ala lie 
370 375 " 380 

Thr lie Asn Gly Glu Lys Gin Ser Asp Pro Glu Tyr Phe Phe Lys Glu 
385 ~ 390 395 400 

Glu Asp Arg Leu Phe Gly Arg Phe Thr Leu Leu Arg Arg Gly Lys Lys 
405 410 ~ ~ 415 

Asn Tyr Cys Leu lie Cys Trp Lys 
420 

<210> 30 
<211> 135 
<212> DNA 
<213> Artificial 

<220> 

<223> a sequence of one of the induced expression systems prepared in 
Example 2 (TetBstO) 

<400> 30 

tctccctatc agtgatagag atcggagggg tagcgaagtg gctaaacgcg gcggactcta 60 

aatccgctcc ctttgggttc ggcggttcga atccgtcccc ctccagacaa gtgcggtttt 120 

tttctccagc tcccg 135 
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<210> 31 
<211> 145 
<212> DNA 
<213> Artificial 

<220> 

<223> a sequence of one of the induced expression systems prepared in 
Example 1 (TetBstl) 

<400> 31 

tctccctatc agtgatagag atccgtacac gtcggagggg tagcgaagtg gctaaacgcg 60 

gcggactcta aatccgctcc ctttgggttc ggcggttcga atccgtcccc ctccagacaa 120 

gtgcggtttt tttctccagc tcccg 145 

<210> 32 

<211> 155 

<212> DNA 

<213> Artificial 

<220> 

<223> a sequence of one of the induced expression systems prepared in 
Example 2 (TetBst2) 

<400> 32 

tctccctatc agtgatagag atccgccgac acacgtacac gtcggagggg tagcgaagtg 60 

gctaaacgcg gcggactcta aatccgctcc ctttgggttc ggcggttcga atccgtcccc 120 

ctccagacaa gtgcggtttt tttctccagc tcccg 155 

<210> 33 
<211> 85 
<212> DNA 

<213> Escherichia coli 
<400> 33 

ggtggggttc ccgagcggcc aaagggagca gactctaaat ctgccgtcac agacttcgaa 60 
ggttcgaatc cttcccccac caeca 85 

<210> 34 
<211> 85 
<212> DNA 

<213> Bacillus stearothermophilus 
<400> 34 

ggaggggtag cgaagtggct aaacgcggcg gactctaaat ccgctccctt tgggttcggc 60 
ggttcgaatc cgtccccctc caeca 85 
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